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Abstract 



0\ 

o 
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In this paper, we consider a scenario where a source node wishes to broadcast two confidential messages to two receivers, 
while a wire-tapper also receives the transmitted signal. This model is motivated by wireless communications, where individual 
secure messages are broadcast over open media and can be received by any illegitimate receiver. The secrecy level is measured 
by the equivocation rate at the eavesdropper. We first study the general (non-degraded) broadcast channel with an eavesdropper. 
We present an inner bound on the secrecy capacity region for this model. This inner bound is based on a combination of random 
binning, and the Gelfand-Pinsker binning. We further study the situation in which the channels are degraded. For the degraded 
broadcast channel with an eavesdropper, we present the secrecy capacity region. Our achievable coding scheme is based on 
Cover's superposition scheme and random binning. We refer to this scheme as the Secret Superposition Scheme. Our converse 
proof is based on a combination of the converse proof of the conventional degraded broadcast channel and Csiszar Lemma. We 
then assume that the channels are Additive White Gaussian Noise (AWGN) and show that the Secret Superposition Scheme with 
Gaussian codebook is optimal. The converse proof is based on Costa's entropy power inequality. Finally, we use a broadcast 
strategy for the slowly fading wire-tap channel when only the eavesdropper's channel is fixed and known at the transmitter. We 
derive the optimum power allocation for the coding layers, which maximizes the total average rate. 



j> ■ I. Introduction 



The notion of information theoretic secrecy in communication systems was first introduced by Shannon in [1]. The information 
theoretic secrecy requires that the received signal of the eavesdropper does not provide any information about the transmitted 
. messages. Shannon considered a pessimistic situation where both the intended receiver and the eavesdropper have direct access 
| to the transmitted signal (which is called ciphertext). Under these circumstances, he proved a negative result showing that perfect 
I . secrecy can be achieved only when the entropy of the secret key is greater than, or equal to, the entropy of the message. In 
Q\ j modern cryptography, all practical cryptosystems are based on Shannnon's pessimistic assumption. Due to practical constraints, 
. secret keys are much shorter than messages; therefore, these practical cryptosystems are theoretically susceptible to breaking by 
^ | attackers. The goal of designing such practical ciphers, however, is to guarantee that no efficient algorithm exists for breaking 
■ ■ *h , them. 

Wyner in [2] showed that the above negative result is a consequence of Shannon's restrictive assumption that the adversary 
£2 ■ has access to precisely the same information as the legitimate receiver. Wyner considered a scenario in which a wire-tapper 
receives the transmitted signal over a degraded channel with respect to the legitimate receiver's channel. He further assumed 
that the wire-tapper has no computational limitations and knows the codebook used by the transmitter. He measured the 
level of ignorance at the eavesdropper by its equivocation and characterized the capacity-equivocation region. Interestingly, a 
non-negative perfect secrecy capacity is always achievable for this scenario. 

The secrecy capacity for the Gaussian wire-tap channel is characterized by Leung-Yan-Cheong in [3], Wyner's work is 
then extended to the general (non-degraded) broadcast channel with confidential messages by Csiszar and Korner [4]. They 
considered transmitting confidential information to the legitimate receiver while transmitting common information to both the 
legitimate receiver and the wire-tapper. They established a capacity-equivocation region for this channel. The BCC has recently 
been further studied in [5]-[7], where the source node transmits a common message to both receivers, along with two additional 
confidential messages, each aimed at one of the two receivers. Here, the confidentiality of each message is measured with 
respect to the other user, and there is no external eavesdropper. 

The fading wire-tap channel is investigated in [8] where the source-to destination channel and the source-to-eavesdropper 
channel are corrupted by multiplicative fading gain coefficients, in addition to additive white Gaussian noise. In this work, 
channels are fast fading and the Channel State Information (CSI) of the legitimate receiver is available at the transmitter. The 
perfect secrecy capacity is derived for two different scenarios regarding the availability of the eavesdropper's CSI. Moreover, 
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the optimal power control policy is obtained for the different scenarios. The effect of the slowly fading channel on the secrecy 
capacity of a conventional wire-tap channel is studied in [9], [10]. In these works, it is assumed that the fading is quasi-static 
and the transmitter does not know the fading gains. The outage probability, which is the probability that the main channel is 
stronger than the eavesdropper's channel, is defined in these works. In an outage strategy, the transmission rate is fixed and 
the information is detected when the instantaneous main channel is stronger than the instantaneous eavesdropper's channel; 
otherwise, either nothing is decoded at the legitimate receiver, or the information is leaked to the eavesdropper. The term 
outage capacity refers to the maximum achievable average rate. In [11], a broadcast strategy for the slowly fading Gaussian 
point to point channel is introduced. In this strategy, the transmitter uses a layered coding scheme and the receiver is viewed 
as a continuum of ordered users. 

In [15], the wire-tap channel is extended to the parallel broadcast channels and also to the fading channels with multiple 
receivers. In [15], the secrecy constraint is a perfect equivocation for each of the messages, even if all the other messages are 
revealed to the eavesdropper. The secrecy sum capacity for a reverse broadcast channel is derived subject to this restrictive 
assumption. The notion of the wire-tap channel is also extended to multiple access channels [16]— [19], relay channels [20]-[23], 
parallel channels [24] and Multiple-Input Multiple-Output channels [25]— [31]. Some other related works on the communication 
of confidential messages can be found in [32]-[36]. 

In this paper, we consider a scenario where a source node wishes to broadcast two confidential messages to two receivers, 
while a wire-tapper also receives the transmitted signal. This model is motivated by wireless communications, where individual 
secure messages are broadcast over shared media and can be received by any illegitimate receiver. In fact, we simplify the 
restrictive constraint imposed in [15] and assume that the eavesdropper does not have access to the other messages. We first 
study the general broadcast channel with an eavesdropper. We present an achievable rate region for this channel. Our achievable 
coding scheme is based on a combination of random binning and the Gelfand-Pinsker binning [37]. This scheme matches the 
Marton's inner bound [38] on the broadcast channel without confidentiality constraint. We further study the situation where the 
channels are physically degraded and characterize the corresponding secrecy capacity region. Our achievable coding scheme is 
based on Cover's superposition coding [39] and random binning. We refer to this scheme as the Secret Superposition Coding. 
This capacity region matches the capacity region of the degraded broadcast channel without any security constraint. It also 
matches the secrecy capacity of the wire-tap channel. We also characterize the secrecy capacity region when the channels are 
additive white Gaussian noise. We show that the secret superposition of Gaussian codebooks is the optimal choice. Based on 
the rate characterization of the secure broadcast channel, we then use broadcast strategy for the slow fading wire-tap channel 
when only the eavesdropper's channel is fixed and known at the transmitter. In broadcast strategy, a source node sends secure 
layers of coding and the receiver is viewed as a continuum of ordered users. We derive optimum power allocation for the 
layers which maximizes the total average rate. 

In [40], we published a conference version of this work where the achievable rate region of the general broadcast channel 
with an eavesdropper and the secrecy capacity region of the degraded one were addressed. However, we later became aware 
that reference [41], [42] had considered a similar model as used in this paper and had independently characterized the secrecy 
capacity region of the broadcast channel (when the channels are degraded). They also generalized their results to the parallel 
degraded broadcast channel with an eavesdropper. Independently and parallel to our work, reference [43] considered the 
Gaussian broadcast channel with an eavesdropper and characterized its capacity region. Authors of [43] provided two methods 
for their converse proof. The first one uses the alternative representation of the mutual information as an integration of the 
minimum-mean-square-error (MMSE), as well as the properties of the MMSE. The second one uses the relationship between 
the differential entropy and the Fisher information via the de Bruin identity, along with the properties of the Fisher information. 
In this work, however, we use Costa's entropy power inequality to provide the converse proof. 

The rest of the paper is organized as follows: in section II we introduce the system model. In section III we provide an 
inner bound on the secrecy capacity region when the channels are not degraded. In section IV we specialize our channel to the 
degraded ones and establish the secrecy capacity region. In section V we derive the secrecy capacity region when the channels 
are AWGN. Based on the secrecy capacity region of the AWGN channel, in section VI we use a broadcast strategy for the 
slow fading wire-tap channel when the transmitter only knows the eavesdropper's channel. Finally, section VII concludes the 
paper. 

II. Preliminaries 

In this paper, random variables are denoted by capital letters (e.g. X) and their realizations are denoted by corresponding 
lower case letters (e.g. x). The finite alphabet of a random variable is denoted by a script letter (e.g. X) and its probability 
distribution is denoted by P(x). The vectors will be written as x n = (xi, X2, X n ), where subscripted letters denote the 
components and superscripted letters denote the vector. Bold capital letters represent matrices (e.g. A). The notation x 1 " 1 
denotes the vector (xi,X2, ...,Xi-i) and the notation x % denotes the vector (xi,Xi + i, ...,x n ). A similar notation will be used 
for random variables and random vectors. 

Consider a Broadcast Channel with an eavesdropper (BCE) as depicted in Fig. [T] In this confidential setting, the transmitter 
wishes to send two independent messages (WijWjj) to the respective receivers in n uses of the channel and prevent the 



3 



(W h W 2 ) 



Encoder 



X" 



P{y u y 2 ,z\x) 



Fig. 1. Broadcast Channel with an Eavesdropper 
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eavesdropper from having any information about the messages. A discrete memoryless broadcast channel with an eavesdropper 
is represented by (X, P, J^i, y 2 , Z) where, X is the finite input alphabet set, 3^i, 3^2 and Z are three finite output alphabet sets, 
and P is the channel transition probability P(yi,y 2 , z\x). The input of the channel is x n £ X n and the outputs are y" £ y™, 
y 2 £ y 2 , and z n £ Z n for Receiver 1, Receiver 2, and the eavesdropper, respectively. The channel is discrete memoryless in 
the sense that 



P(y?, 2/ 2 ™, z n \x n ) = [] p (yi,i>m,i> z *\ x i)- 
i=i 

A ((2 nRl ,2 nIt2 ),n) code for a broadcast channel with an eavesdropper consists of a stochastic encoder 

/:({l,2,...,2 nfll }x{l,2,...,2 nfla })->^ n ) 

and two decoders, 

fl i:^-{l,2,...,2 nfl1 } 

and 

g 2 :y2^{l,2,...,2 nR >}. 



(1) 

(2) 
(3) 
(4) 



The average probability of error is defined as the probability that the decoded messages are not equal to the transmitted 
messages; that is, 

Pi") = P( 9 i(17) + Wx U g 2 (Y?) W 2 ). (5) 
The knowledge that the eavesdropper can extract about W\ and W 2 from its received signal Z n is measured by 

I(Z n ,W 1 ) = H(W 1 )-H(W 1 \Z n ), (6) 

I(Z n ,W 2 ) = H(W 2 )-H(W 2 \Z n ), (7) 

and 

I(Z n ,{W u W 2 )) = H{W!,W 2 ) - H{Wi,W 2 \Z n ). (8) 

Perfect secrecy revolves around the idea that the eavesdropper should not obtain any information about the transmitted messages. 
Perfect secrecy thus requires that 

I(Z n ,W 1 ) = O^H(W 1 ) = H(W 1 \Z n ), 
I(Z n , W 2 ) = & H(W 2 ) = H(W 2 \Z n ), 

and 

I{Z n , (Wi, W 2 )) = & H{W U W 2 ) = H(W U W 2 \Z n ). 

where n — > oo. The secrecy levels of confidential messages W\ and W 2 are measured at the eavesdropper in terms of 
equivocation rates which are defined as follows: 

Definition 1 The equivocation rates R e \, R e2 and R e \ 2 for the broadcast channel with an eavesdropper are: 

Pel = -H{W!\Z n ), 

n 

R e2 = -H(W 2 \Z n ), 
n 

R el2 = -H(W u W 2 \Z n ). 

n 
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The perfect secrecy rates R\ and R 2 are the amount of information that can be sent to the legitimate receivers in a reliable 
and confidential manner. 

Definition 2 A secrecy rate pair (Ri,R 2 ) is said to be achievable if for any e > 0, e\ > 0, e 2 > 0,63 > 0, there exists a 
sequence of ((2 nRl ,2 nR2 ),n) codes, such that for sufficiently large n, we have: 



In the above definition, the first condition concerns the reliability, while the other conditions guarantee perfect secrecy for each 
individual message and the combination of the two messages, respectively. Since the messages are independent of each other, 
the conditions of ( fTOb and ([T2l > or (fTTT l and ( TT2l > are sufficient to provide perfect secrecy. The capacity region is defined as 
follows. 

Definition 3 The capacity region of the broadcast channel with an eavesdropper is the closure of the set of all achievable 
rate pairs (Rx,R 2 ). 



In this section, we consider the general broadcast channel with an eavesdropper and present an achievable rate region. Our 
achievable coding scheme is based on a combination of the random binning, superposition coding, rate splitting, and Gelfand- 
Pinsker binning schemes [37]. Our binning approach is supplemented with superposition coding to accommodate the common 
message. We call this scheme the Secret Superposition Scheme. An additional binning is introduced for the confidentiality of 
private messages. We note that these double binning techniques have been used by various authors for secret communication 
(see e.g. [5], [7]). The following theorem illustrates the achievable rate region for this channel. 

Theorem 1 Let Rj denote the union of all non-negative rate pairs (Rq, Ri, R 2 ) satisfying 



R < mm{I(U; Y x ), I(U; Y 2 )} - /([/; Z), 
Ri + Ro< HViiY^U) - I(V i; Z\U) + mm{I{U; Y t ), I(U; Y 2 )} - I(U; Z), 
R2 + Ro< I(V 2 ;Y 2 \U) ~ I(V 2 ; Z\U) + mm{I(U; Fi),/(?7; Y 2 )} - I(U; Z), 
R 1 + R2 + R <I{Vr,Y 1 \U) + I{V 2 )Y 2 \U)-I{V u V2]Z\U)-I{Vv,V2\U) + 



over all joint distributions P{u)P(v\, v 2 \u)P(x\vi, v 2 )P{yi, y 2l z\x). Any rate pair (i?o, Ri, R2) G is then achievable for 
the broadcast channel with an eavesdropper and with common information. 

Please see Appendix Q] for the proof. 

Remark 1 // we remove the secrecy constraints by removing the eavesdropper, the above rate region becomes Marton's 
achievable region with common information for the general broadcast channel. 

Remark 2 If we remove one of the users, e.g. user 2 and the common message, then we get Csiszar and Korner's secrecy 
capacity for the other user. 

IV. The Capacity Region of the Degraded BCE 
In this section, we consider the degraded broadcast channel with an eavesdropper and establish its secrecy capacity region. 

Definition 4 A broadcast channel with an eavesdropper is said to be physically degraded, if X — > Y\ — » Y 2 — » Z forms a 
Markov chain. In other words, we have 




(9) 
(10) 

(11) 
(12) 



Rel> Ri — £1) 
Re2> R-2 — £2, 
Rel2> Rl + R 2 — £3- 



III. Achievable Rates for General BCE 



P(yi,V2,z\x) = P(yi\x)P(y 2 \ yi )P(z\y 2 ). 



Definition 5 A broadcast channel with an eavesdropper is said to be stochastically degraded if its conditional marginal 
distributions are the same as that of a physically degraded broadcast channel, i.e., if there exist two distributions P (y 2 \yi) 
and P [z I y 2 ), such that 
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p(y2\x)=J2 p (y^)p'(y2\vi), 



P{z\x)=Y J P{y2\x)P\z\y 2 ). 



yi 



Lemma 1 The secrecy capacity region of a broadcast channel with an eavesdropper depends only on the conditional marginal 
distributions P(yi\x), P(y 2 \x) and P(z\x). 

Proof: It suffices to show that the error probability of P e (n) and the equivocations of H(Wi\Z n ), H(W 2 \Z n ) and 
H(Wi, W 2 \Z n ) are only functions of the marginal distributions when we use the same codebook and encoding schemes. Note 
that 



In the following theorem, we fully characterize the capacity region of the physically degraded broadcast channel with an 
eavesdropper. 

Theorem 2 The capacity region for transmitting independent secret information over the degraded broadcast channel is the 
convex hull of the closure of all [Ri^Ri) satisfying 



for some joint distribution P(u)P(x\u)P(yi,y 2 , z\x). 
Please refer to Appendix [TT] for the proof. 

Remark 3 If we remove the secrecy constraints by removing the eavesdropper, then the above theorem becomes the capacity 
region of the degraded broadcast channel. 

The coding scheme is based on Cover's superposition coding and random binning. We refer to this scheme as the Secure 
Superposition Coding scheme. The available resources at the encoder are used for two purposes: to confuse the eavesdropper 
so that perfect secrecy can be achieved for both layers, and to transmit the messages into the main channels. To satisfy 
confidentiality, the randomization used in the first layer is fully exploited in the second layer. This makes an increase of 
I(U; Z) in the bound of Ri. 

Remark 4 As Lemma [2] bounds the secrecy rates for the general broadcast channel with an eavesdropper then, Theorem [2] is 
true when only the legitimate receivers are degraded. 



In this section, we consider the Gaussian Broadcast Channel with an Eavesdropper (G-BCE). Note that optimizing ( fT3] l and 
(fl4] i for AWGN channels involves solving a nonconvex functional. Usually nontrivial techniques and strong inequalities are used 
to solve the optimization problems of this type. In [3], Leung-Yan-Cheong successfully evaluated the capacity expression of the 
wire-tap channel by using the entropy power inequality [44], [45]. Alternatively, it can also be evaluated using a classical result 
from the Estimation Theory and the relationship between mutual information and minimum mean-squared error estimation. 
On the other hand, the entropy power inequality is sufficient to establish the converse proof of a Gaussian broadcast channel 
without secrecy constraint. Unfortunately, the traditional entropy power inequality does not extend to the secure multi-user 
case. Here, by using Costa's version of the entropy power inequality, we show that secret superposition coding with Gaussian 
codebook is optimal. 

Figure |2] shows the channel model. At time i the received signals are Yu = Xi + Nu, Y 2 i = Xi + N 2 i and Zi = Xj + N^i, 
where Nji is a Gaussian random variable with zero mean and Var(Nji) — er 2 for j = 1,2,3. Here u\ < u\ < <r|. Assume 
that the transmitted power is limited to E^X 2 ] < P. Since the channels are degraded, the received signals can alternatively be 
written as Y u = X { + N u , Y 2i = Y u + N 2i and Z { = Y 2i + N' 3i , where Nu's are i.i.d 7V(0, a\), iV^'s are i.i.d A/"(0, o\ - a\), 
and A^'s are i.i.d A^(0, ct| — er 2 ). Fig. [3] shows the equivalent channels for the G-BCE. The following theorem illustrates the 
secrecy capacity region of G-BCE. 




i?i< I{X;Yx\U)-I{X;Z\U), 
R 2 <I(U;Y 2 )-I{U;Z), 



(13) 
(14) 



V. Capacity Region of Gaussian BCE 
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Fig. 2. Gaussian Broadcast Channel with an Eavesdropper(G-BCE) 
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Theorem 3 The secrecy capacity region of the G-BCE is given by the set of rates pairs (i?i,i?2) satisfying 

H<c(!g)-c(%\ 



for some a £ [0, 1]. 



aP + a% 



aP + cri 



(15) 
(16) 



Please see Appendix [TTT] for the proof. 

Figure [4] shows the capacity region of a degraded Gaussian broadcast channel with and without secrecy constraint. In this 
figure P = 20, Ni = 0.9, N 2 = 1.5 and N 3 = 4. 

VI. A Multilevel Coding Approach to the Slowly Fading Wire-Tap Channel 

In this section, we use the secure degraded broadcast channel from the previous section to develop a new broadcast strategy 
for a slow fading wire-tap channel. This strategy aims to maximize the average achievable rate where the main channel 




R2 (bit/sec) 

Fig. 4. Secret/Non-Secret Capacity Region of a Degraded Broadcast Channel 
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Fig. 6. Equivalent Broadcast Channel Model. 



state information is not available at the transmitter. By assuming that there is an infinite number of ordered receivers which 
correspond to different channel realizations, we propose a secret multilevel coding scheme that maximizes the underlying 
objective function. First, some preliminaries and definitions are given, and then the proposed multilevel coding scheme is 
described. Here, we follow the steps of the broadcast strategy for the slowly fading point-to-point channel of [11]. This method 
is used in several other papers; see, e.g, [12]— [14]. 

A. Channel Model 

Consider a wire-tap channel as depicted in Fig|5] The transmitter wishes to communicate with the destination in the presence 
of an eavesdropper. At time i, the signal received by the destination and the eavesdropper are given as follows 

Yi = h M Xi + N u (17) 
Zi = HeX, + N 2i 

where Xi is the transmitted symbol and Km, h>E are the fading coefficients from the source to the legitimate receiver and to 
the eavesdropper, respectively. The fading power gains of the main and eavesdropper channels are given by s = |/im| 2 and 
s = \Iie\ 2 , respectively. Nu, N 2 i are the additive noise samples, which are Gaussian i.i.d with zero mean and unit variance. 
We assume that the channels are slowly fading, and also assume that the transmitter knows only channel state information of 
the eavesdropper channel. A motivation for this assumption is that when both channels are unknown at the transmitter, we 
assume that s = \1ie\ 2 denotes the best-case eavesdropper channel gain. For each realization of Jim there is an achievable 
rate. Since the transmitter has no information about the main channel and the channels are slowly fading, then the system 
is non-ergodic. Here, we are interested in the average rate for various independent transmission blocks. The average shall be 
calculated over the distribution of Km- 

B. The Secret Multilevel Coding Approach 

An equivalent broadcast channel for our channel is depicted in Fig. [6] wherein the transmitter sends an infinite number of 



s 



secure layers of coded information. The receiver is equivalent to a continuum of ordered users. For each channel realization 
h\j with the fading power gain s k , the information rate is R(s k ,s ). We drop the superscript k, and the realization of the 
fading power random variable S is denoted by s. Therefore, the transmitter views the main channel as a secure degraded 
Gaussian broadcast channel with an infinite number of receivers. The result of the previous section for the two receivers can 
easily be extended to an arbitrary number of users. According to theorem [3] the incremental differential secure rate is then 
given by 



dR(s, s ) 



sp(s)ds 
1 + sl(s) 



I log ( 1 



s p(s)ds 
1 + s'I{s) 



(18) 



where p(s)ds is the transmit power of a layer parameterized by s, intended for receiver s. As log(l + x) « x for x < 1 then 
the log function may be discarded. The function I(s) represents the interference noise of the receivers indexed by u > s which 
cannot be canceled at receiver s. The interference at receiver s is therefore given by 



I( S ) 



p(u)d(u) 



The total transmitted power is the summation of the power assigned to the layers 



1(0) 



p(u)d(u) 



(19) 



(20) 



The total achievable rate for a fading realization s is an integration of the incremental rates over all receivers, which can 
successfully decode the respective layer 



R(s,s') = l 



up[u)du s p(u)du 



l + ul(u) l + s'I(u) 



(21) 



Our goal is to maximize the total average rate over all fading realizations with respect to the power distribution p(s) (or 
equivalently, with respect to I(u), u > 0) under the power constraint of [20] The optimization problem may be written as 



-Rmax= max 

I(u) 

s.t 



R{u, s )f(u)du, 



(22) 



P= HO) = f P(u)d(u), 
Jo 



where f(u) is the probability distribution function (pdf) of the power gain S. Noting that the cumulative distribution function 
(cdf) is F(u) = j Q l f(a)da, the optimization problem may be written as 



1 

— max 

2 i(u) 



s.t 



where G(u) 



!+«/(«) l + s'/(«) 



(1 - F(u))G{u)du, 

P= 1(0) = / p(u)d(u), 
Jo 

—I (u). Therefore, the functional in 



(23) 



p(u). Note that p(u 

J(x,I(x) } l'(x)) = 
-(l-F(x)) 



may be written as 



l + xl(x) l + s'I(x) 
The necessary condition for the maximization of an integral of J over x is 



I(x). 



(24) 



(25) 



where Jj means the derivation of function J with respect to /, and similarly Jj> is the derivation of J with respect to / . 
After some manipulations, the optimum I(x) is given by 



l-F(x)-(x-s )f(x) 
I(x) = < s'(l-F(x))+x(x-s')f(x)- 

o, 



max{s ,xq} < x < x\\ 
otherwise, 



where xq is determined by I(xq) = P, and X\ by I(xi) 
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As a special case, consider the Rayleigh flat fading channel. The random variable S is exponentially distributed with 

/(.s)=e" s , F{s) = l-e~ s , s > 0. (26) 

Substituting /(s) and F(s) into the optimum I(s) and taking the derivative with respect to the fading power s yields the 
following optimum transmitter power policy 



rl ( -s 2 +2(s'+l)s-s' 2 r ' 1 ^ ^ 

ds { 0, otherwise, 



P(s) = 

where so is the solution of the equation I(sq) = P, which is 

-1 + Ps + ^P 2 s" 2 + 2P(1 - 2P)s + 4P + 1 

50 " 2P ■ 

and Si is determined by I(si) = 0, which is 

si = 1 + s. 



VII. Conclusion 

A generalization of the wire-tap channel in the case of two receivers and one eavesdropper was considered. We established an 
inner bound for the general (non-degraded) case. This bound matches Marton's bound on broadcast channels without security 
constraint. Furthermore, we considered the scenario in which the channels are degraded. We established the perfect secrecy 
capacity region for this case. The achievability coding scheme is a secret superposition scheme where randomization in the 
first layer helps the secrecy of the second layer. The converse proof combines the converse proof for the degraded broadcast 
channel without security constraint, and the perfect secrecy constraint. We proved that the secret superposition scheme with the 
Gaussian codebook is optimal in AWGN-BCE. The converse proof is based on Costa's entropy power inequality and Csiszar 
lemma. Based on the rate characterization of the AWGN-BCE, the broadcast strategy for the slowly fading wire-tap channel 
were used. In this strategy, the transmitter only knows the eavesdropper's channel and the source node sends secure layered 
coding. The receiver is viewed as a continuum of ordered users. We derived the optimum power allocation for the layers, 
which maximizes the total average rate. 



Appendix I 
Proof of TheoremQ] 

We split the private message W x E {1, 2, 2 nRl } into W n E {1, 2, 2 nRl1 } and W 10 E {1, 2, 2 nRl0 }, and W 2 E 
{1,2, ...,2 nR2 } into W 22 E {l,2,...,2 ni?22 } and W 20 E {1, 2, 2 nR ™}, respectively. W n and W 22 are only to be decoded 
by the intended receivers, while Wio and W20 are to be decoded by both receivers. Now, we combine (VF10, W20, Wq) into a 
single auxiliary variable U. The messages Wn and W 22 are represented by auxiliary variables V\ and V2, respectively. Here, 
Rio + itii = i?i and R20 + R22 = 

1) Codebook Generation: The structure of the encoder is depicted in FigJT] Fix P(u), P(vi\u), P(v2\u) and P(x\v\,V2). 
The stochastic encoding is as follows. Define 

L n = I(Vi,Yi\U) - I(Vi;Z,V 2 \U), 
L l 2=I{V 1 ;Z\V2,U), 
L 21 =I(V 2 ;Z\V 1 ,U) 
L 22 = I(V 2 ;Y 2 \U) - I{V 2 ;Z,Vx\U), 
L 3 = I(V i; V 2 \U) - e, 

Note that, 

L u + Lia + L 3 =I(yi;Y 1 \U)-e, 
L22 + L 2 i + L 3 = I(V2;Y 2 \U) - e, 

Rn> in > 0, (27) 
i? 22 > L 22 > 0. (28) 

Generate 2 n( - Rl0+R20+R ^ independent and identically distributed (i.i.d) sequences u n (k) with k E {1, 2, 2 Rl0+R20+R °J, ac- 
cording to the distribution P(u n ) = Y[7=i P{ u i)- F° r each codeword u n (k), generate 2 Lll+Ll2+L3 i.i.d codewords i ,i ), 
with i E {l,2,...,2 ,lLl1 }, 1 E {l,2,...,2 nil2 } and i" E {1, 2, 2 nL3 }, according to P(v?\u n ) = U7=i P ( v u\ui)- The 
indexing presents an alternative interpretation of binning. Randomly distribute these sequences of w"into 2 nLl1 bins indexed 



We first prove the case where 



10 




by i, for the codewords in each bin, randomly distribute them into 2 nLl2 sub-bins indexed by i ; thus i is the index for the 
codeword in each sub-bin. Similarly, for each codeword u n , generate 2 L21+L22+L3 i.i.d codewords v^ij^j ,j ) according to 
P(v2\u n ) = H.Li p (vM, where j g {1, 2, .., 2 nL ^}, j g {1,2, ...,2 nL ™} and j" g {1, 2, 2" L ^}. 

2) Encoding: To send messages (iuio, W20, Wo), we calculate the corresponding message index k and choose the corresponding 
codeword u n (k). Given this u n (k), there exists 2 n ( Lll+Ll2+L3 ^ codewords of v%(i,i ,i ) to choose from for representing 
message wn. Evenly map 2 11 messages W\\ to 2 nLl1 bins, then, given d27b . each bin corresponds to at least one message 
wn. Thus, given W\±, the bin index i can be decided. 

1) If-Rn < in + L12, each bin corresponds to 2™ (iJll_Lll) messages w n . Evenly place the 2 nLl2 sub-bins into 2" (i?11_ill) 
cells. For each given wn, we can find the corresponding cell,then, we randomly choose a sub-bin from that cell, thus 
the sub-bin index i can be decided. The codeword (i, i A ) will be chosen properly from that sub-bin. 

2) If Ln + L12 < Rn < L\\ + L12 + L3, then each sub-bin is mapped to at least one message wn, therefore, given 
mn, i cab be decided. In each sub-bin, there are 2 n ( Rll ~ Lll ~ Ll2 " > messages. The codeword u™ A ) will be chosen 
randomly and properly from that sub-bin. 

Given W22, we select v^ij,] ,j ) in the exact same manner. From the given sub-bins , the encoder chooses the codeword 
pair (vi(i,i ,i ),v% ,j )) that satisfies the following property, 

« (*,*',*"), v2 (j, j',j"))eAW(V u V 2 ,U) 

where Ae(U, V\, V2) denotes the set of jointly typical sequences u n , u™, and with respect to P{u, v\, V2). If there is more 
than one such pair, the transmitter randomly chooses one; if there is no such pair, an error is declared. 

Given w" and wj, the channel input x n is generated i.i.d. according to the distribution P(x n \v" , v% ) = n™=i P( x i\ v ih v 2i)- 

3) Decoding: The received signals at the legitimate receivers, yf and y^, are the outputs of the channels P(y r {\x n ) — 
nr=i P(yi,i\ x i) an< ^ P(V2 \ xU ) — n"=i P{V2,i\ x i)' respectively. The first receiver looks for the unique sequence u n (k) such 
that 

(« n (A),W?)G^(^n). 

If such u n {k) exists and is unique, set k — k; otherwise, declare an error. Upon decoding k, this receiver looks for sequences 
Vi(i, i , i ) such that 

«(M ,i"),u n (k),y?) g A^{V X ,U,Y X ). 

If such u™ (i, i , i ) exists and is unique, set i = i, i = i , and i = i ; otherwise, declare an error. Using the values of k, i, i 
and i , the decoder can calculate the message indices wq, t&io an d u>n- The decoding for the second decoder is similar. 

4) Error Probability Analysis: Since the region of Rj is a subset of the Marton's region, then the error probability analysis 
is the same as [38]. 
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5) Equivocation Calculation: To meet the secrecy requirements, we need to prove that the common message Wo, the 
combination of (Wb,Wi), the combination of (Wo, W2), and the combination of (Wo,Wi, W2) are perfectly secured. The 
proof of secrecy requirement for the message Wo is straightforward and is therefore omitted. 

To prove the secrecy requirement for (Wo, Wi), we have 

nR el0 = H(Wx,W \Z n ) 

= H{W u W^Z n )-R(Z n ) 

= H(Wi,W , U n , Vl\ Z n ) - H(U n , V?\Wi,W , Z n ) - H(Z n ) 

= H(Wi,W , U n , VI 1 ) + H{Z n \W u Wo, U n , V?) - H{U n \W u W , Z n ) - H{V?\Wi,W Q , Z n , U n ) - H{Z n ) 

> H(W 1: Wo, U n , YD + H(Z n \W u Wo, U n , V?) - ne n - H(Z n ) 



H(W U Wo, U n , V") + H(Z n \U n , V™) - ne n - H(Z n ) 



> H{U n , F x ") + H(Z n \U n , V?) - ne n - H{Z n ) 
= H(U n ) + H(V?\U n ) - I(U n , V?; Z n ) - ne n 

> mm{I(U n ; Y?),I(U n ; Y 2 n )} + I(Vf; Y?\U n ) - I(V{ 1 ; Z n \U n ) - I(U n ; Z n ) - ne n 

> nRi + nRo — ne n , 

where (a) follows from Fano's inequality that bounds the term H(TJ n \W\, Wo, Z n ) < h(P^ ) + nP™ e0 R w0 < ne n /2 and 
the term H{V^\W U W , Z n , U n ) < h{P^ e \) + nP™ el R wl < ne n /2 for sufficiently large n. Here P" e0 and P" el denotes the 
wiretapper's error probability of decoding u n and V™ in the case that the bin numbers wq and wi are known to the eavesdropper, 
respectively. The eavesdropper first looks for the unique u n in bin wq of the first layer, such that it is jointly typical with z n . 
As the number of candidate codewords is small enough, the probability of error is arbitrarily small for a sufficiently large n. 
Next, given u n , the eavesdropper looks for the unique in the bin w\ which is jointly typical with z n . Similarly, since the 
number of available candidates is small enough, then the probability of decoding error is arbitrarily small, (b) follows from the 
fact that (Wi, W ) -> U n -> V" -> Z n forms a Markov chain. Therefore, we have I{Wi, W ;Z n \U n , V?) = 0, where it is 
implied that H(Z n \Wi,Wo, U n , V") = H(Z n \U n ,V{ 1 ). (c) follows from the fact that H(Wi, W , U n , X n ) > H{U n ,X n ). 
(d) follows from that fact that H(U n ) > mm{I(U n ; Yf), I(U n ; Y 2 n )} and H(V{ L \U n ) > I(V?;Y{ l \U n ). (e) follows from 
Lemma 2] of the appendix II VI 

By using the same approach it is easy to show that, 

nR e2Q = H(W 2 ,W \Z n ) 

> nR 2 + nRo — ne n - 
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Therefore, we only need to prove that (Wq, W\, W 2 ) is perfectly secured; we have 

nR el20 = H{Wi,W 2 ,W \Z n ) 

= H(W 1 ,W 2 ,W ,Z n )-H(Z n ) 

= H(W 1: W 2 ,W , U n , Vr, V 2 n , Z n ) - H(U n , V?, V 2 n \W 1: W 2 , W , , Z n ) - H{Z n ) 

= H(W 1: W 2 ,W , U n , Vr, V 2 n ) + H(Z n \Wi, W 2 , W 0> U n , Vf, V 2 n ) - H(U n , V?, V 2 n \W u W 2 , W , Z n ) 

- H{Z n ) 

> H(W lt W 2 ,W , U n , Vf, V 2 n ) + E{Z n \W u W 2 , W , U n , V?, V 2 n ) - ne n - H(Z n ) 



(b) 



H(W 1} W 2 ,W , U n , V?, V 2 n ) + H(Z n \U n , V™, V 2 n ) - ne n - H(Z n ) 



> H(U n , V?, V 2 ) + H(Z n \U n , V™, V 2 n ) - ne n - H(Z r ' 



(d) 



H(U n ) + H(V{ l \U n ) + H(V 2 n \U n ) - I(V?; V 2 n \U n ) + H(Z n \U n , V? , V 2 n ) - ne n - H(Z n ) 



> mm{I(U n ; F"), I(U n ; Y 2 n )} + I{V?; Y?\U n ) + I(V 2 n ; Y 2 n \U n ) - I{V?; V 2 n \U n ) - I(U n , V™, V 2 n ; Z n ) 

- ne n 
(/) 

> min{7(C7 n ; I(U n ; Y 2 n )} + I(Vf; Y£\U n ) + I(V 2 ; Y 2 n \U n ) - I(V?; V 2 n \U n ) - I(V", V 2 n ; Z n \U n ) 

- I(U n ;Z n )~ne n 

(s) 

> nmm{I(U; Yi), 1(17; Y 2 )} + nI(Vi;Y 1 \U) + nI(V 2 ;Y 2 \U) - nI(V r , V 2 \U) - nI{V x , V 2 ; Z\U) 

- nI(U; Z) — ne n 

> nRi + nR 2 + uRq — ne n , 

where (a) follows from Fano's inequality, which states that for sufficiently large n, H(U n , Vf 1 , V 2 n \Wi, W 2 , Wq, Z n ) < 
h{p!^e) + n Pwe-Rw < ne n . Here P™ e denotes the wiretapper's error probability of decoding (u n ,v",v 2 ) in the case that 
the bin numbers wq, w\, and w 2 are known to the eavesdropper. Since the sum rate is small enough, then P" e — * 
for sufficiently large n. (b) follows from the following Markov chain: (Wi,W2,Wq) — > (U n ,V{ l ,V 2 n ) — ► Z n . Hence, we 
have H(Z n \W 1 ,W 2 ,W , U n , V{\ V 2 n ) = H(Z n \U n , V[ l , V 2 n ). (c) follows from the fact that H(W 1 ,W 2 , W , U n , V{\ V 2 n ) > 
H(U n , Vf, V 2 n ). (d) follows from that fact that H(U n , Vf, V 2 n ) = H(U n ) + H(V?\U n ) + H{V 2 n \U n ) - I{V{ L ; V 2 n \U n ). (e) 
follows from the fact that H{U n ) > mm{I(U n ;Y{ 1 ), I(U n ;Y 2 n )} and H(V?\U n ) > I{V t n ; Y[ l \U n ) for i = 1,2. (/) follows 
from the fact that I(U n , V{ 1 , V 2 n ; Z n ) = I(U n ;Z n ) + 7(V r 1 ™, V 2 n ; Z n \U n ). (g) follows from Lemma and Lemma in the 
appendix II VI This completes the achievability proof. 

Appendix II 
Proof of Theorem[2] 

Achievablity: We need to show that the region of ( fT3l and (fl4] i is a subset of the achievability region of Theorem Q] In the 
achievability scheme of Theorem Q] if we set W 2 = and rename Wq with W 2 , then using the degradedness, we obtain the 
following region, 

Ri + R 2 < I(V; Y 1 \U) - I(V; Z\U) + I(U; Y 2 ) - I(U; Z), (29) 
R 2 <I{U;Y 2 )-I{U;Z). 

Note that since the first receiver decodes both messages, the total rate of this receiver is Rx <— R\ + R 2 and we have 

Ri< I(UV; Y X \U) + I(U; Y 2 ) ~ I(UV; Z), (30) 
R 2 <I(U;Y 2 )-I(U;Z). 

Now, since U—>V—>X—>Y 2 — > Z is a markov chain, then the following region is a subset of the above region, and 
consequently, it is achievable, 

Ri<I(X;Y 1 \U)+I(U;Z)-I(X;Z), (31) 
R 2 <I{U;Y 2 )~I{U;Z). 

which is the same as that of region ( fT3l and (TBI . This completes the achievability proof. 

Converse: The transmitter sends two independent secret messages Wi and W 2 to Receiver 1 and Receiver 2, respectively. Let 
us define Ui = (W 2 , Y^ 1 ). The following Lemma bounds the secrecy rates for a general case of (W\, W 2 ) — > X n — * Y{ l Y 2 n Z n : 



Lemma 2 For the broadcast channel with an eavesdropper, the perfect secrecy rates are bounded as follows, 

n 

ni?i< Y Y u\W 2 , Z u Yt\Z i+1 ) + nSx + ne 3 , 

i=l 

n 

nR 2 < Y I(W 2 ;Y 2l \Z l , Y*-\ + nS 1 + ne 2 . 

i=l 

Proof: We need to prove the second bound. The first bound can similarly be proven. nR 2 is bounded as follows: 

nR 2 < H{W 2 \Z n )+ne 2 

(b) 

< H(W 2 \Z n ) - H(W 2 \Y 2 n ) + ndt + ne 2 
= I(W 2 ;Y 2 n )-I(W 2 ;Z n ) + nS 1 +ne 2 

where (a) follows from the secrecy constraint that H(W 2 \Z n ) > H{W 2 ) — ne 2 . (b) follows from Fano's inequality that 
H(W 2 \Y 2 n ) < nS-t. Next, we expand I(W 2 ; Y 2 n ) and I(W 2 ; Z n ) as follows. 

n 

I{W 2 -Y 2 n ) = Ys^^Vt 1 ) 

i=l 

n 

= Y T (W 2 , Z i+1 ; Y 2i \Yt ^-I^-Y^W^Yt 1 ) 

i=l 

n 

= ]T ; Y 2i I Yt 1 ,Z i+1 ) + I(Z i+1 ; Y 2i \Yt 1 ) - I(Z t+1 ; Y 2l \ W 2 , Y 2 '~ 1 ) 

i=l 
n 

= ^/(^ 2; r 24 |rr 1 ,^+ 1 ) + A 1 -A 2 , 

where, Ai = £" =1 1{Z l+1 - Y^Y^ 1 ) and A 2 = J2?=i I(Z l+1 ;Y 2t \W 2 ,Y 2 t - 1 ). Similarly, we have, 

n 

I(W 2 ;Z n ) = ^/(^ 2 ;Z t |Z 4+1 ) 
i=i 

n 

= £ J (W 2 , yr 1 ; Z t I ) - I{Yt 1 ;Z i \W 2 ,Z i+1 ) 

n 

= Y I ( w 2\ z i\ Y 2 ~ X , Z t+1 ) + I{Y 2 l - 1 ;Z l \Z l+1 ) - I(Y 2 l - 1 ;Z l \W 2 ,Z l+1 ) 
i=i 

n 

= ^/(^ 2 ;Z l |yr 1 ,Z 4+1 ) + A* 1 -A*, 

i—l 

where, AJ = £™ =1 Z t \Z l+1 ) and A^ = £" =1 1 ( Y 2~ 1 ^ Z l \W 2l Z l+1 ). According to Lemma 7 of [4], A x = AJ and 

A 2 = A 2 . Thus, we have, 

n 

nR 2 < ^/(iy 2 ;y 24 |y 2 l - 1 ,Z 4 + 1 )-/(iy 2 ;Z 4 |y 2 t - 1 ,Z l+1 )+n ( 5 1 +ne 2 

i=l 

n 

= ]TtfGy 2 |z 4 ,yr\^ +1 ) - tf(ty 2 |y 2l ,yr\^ +1 ) + + ne 2 

i=l 
(a) ™ 

< ^ ff(W 2 |Z i; y 2 l - \ - H(W 2 \Y 2i ,Zi, Y 2 l -\ Z l+1 ) + nS 1 + ne 2 

i=l 
n 

= Y, I(W 2 ;Y 2t \Z u YZ-\Z* +1 ) + nS 1 + ne 2 , 

i=l 

where (a) follows from the fact that conditioning always decreases the entropy. ■ 
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Now according to the above Lemma, the secrecy rates are bounded as follows: 

(a) n 

nRi < IiWi;Y lti \W2,Zi, Y^ 1 , Z i+l ) + n5 l + ne 3 
i=i 

n 

= J2 HWi^ipi, Z it Z l+1 ) + n5 1 + ne 3 



i=l 



(6) 



i=l 



(d) 



I{Xi\ Y lti , Ui, Zi\Z i+1 ) - I(X t ; Z t \Z l+1 ) - I(X t ; Ui\Z h Z l+1 ) + n5 1 + ne 3 

n 

Y lti \Ui, Z l+1 ) + /(X; U t \Z l+1 ) - I(X l -Z t \Z l+1 ) - /(X; Ui\Z h Z l+l ) + nSi + ne 3 

i=l 
n 

J2 I(Xi', Yi,i\Ui, Z l+1 ) - /(X; Z t \Z l+1 ) + I{Zi- U t \Z* +1 ) - I{Zi- Ui\X h Z l+1 ) + nti + ne 3 

i=l 
n 

I(X l : l Y hi \Ui, Z l+1 ) - /(X; Z t \Z l+1 ) + I{Zi- U t \Z l+1 ) + n5 x + ne 3 , 



i=l 

where (a) follows from the Lemma (f2|. (bj^ follows from the data processing theorem, (c) follows from the ^hain rule, (d) 
follows from the fact that I(X t ;Y ht , Ui, Zi\Z i+l ) = /(X,; U l \Z i+1 ) + I(X l ;Y ht \U l ,Z i+1 ) + I(X l ; Zi\Y ht , U u Zj +1 ) and from 
the fact that Z i+1 U t — > X., — > Yi, 4 — > Y 2 ,i — > Zi forms a Markov chain, which means that /(X;; Z 4 |F M , Ui, Z l+1 ) = 0. (e) 
follows from the fact that /(X; Ui\Z l+1 ) - /(X,; Ui\Z h Z l+1 ) = I{Zf, U t \Z i+1 ) - I(Zf, U l \X ll Z i+1 ). (/) follows from the 
fact that Z i+1 Ui -> X t -> Z l forms a Markov chain. Thus, I{Z % ; U l Z l+1 \X i ) = which implies that I(Zi\ Ui\X h Z i+1 ) = 0. 
For the second receiver, we have 



(a) 

nR 2 < ^2l(W 2 ;Y 2 ^\Y 2 l - 1 ,Z l ,Z l+1 )+nS 2 + ne 1 

i=l 



J2 H(Y 2<i \Yt 1 ,Z i , Z i+1 ) - H{Y 2ji \W 2 , Yt\ Z t ,Z t+1 ) + nd 2 + ne 1 



(fa) 



< H ^A Z ^ Z l+1 ) - H(Y 2ti \W 2 , Y{-\ Y 2 l -\ Z it Z l+1 ) + nS 2 + nej 



(■0 



(<*) 



Y H(Y 2ti \Zi, Z i+1 ) - H{Y 2 ,i\Ui, Z h Z l+1 ) + nS 2 + ne 1 

i=l 
n 

1{Y2,i] Ui\Zi, Z l+1 ) + nS 2 + ne 1 

i=l 
n 

I{Y2,i\ U>\Z l+1 ) + I(Y 2ti ; Zi\Ui, Z i+1 ) - I(Y 2ji ; Zi\Z i+l ) + nS 2 + ne x 

i=l 
n 

Y I(Y Xl - U t \Z l+1 ) - I{Zi- U t \Z l+1 ) + I{Zi- Ui\Y 2>i , Z t+1 ) + nS 2 + ne x 

i=l 
n 

1(Y 2 ,i; U t \Z l+1 ) -I{Zi- U t \Z l+1 ) + nS 2 + nei, 



where (a) follows from the lemma (fJJ. (b) follows from the fact that conditioning always decreases the entropy, (c) follows 
from the fact that Y^ 1 -> W 2 Z i+1 Yl~ 1 -»■ Y 2i ^ Zi forms a Markov chain, (d) follows from the fact that Z l+1 U l -> 
y 2) z -> Zi forms a Markov chain. Thus I(Z % ; U l Z l+1 \Y 2i ) = which implies that I(Zi; U l \Y 2i , Z i+1 ) = 0. Now, following 
[39], let us define the time sharing random variable Q which is uniformly distributed over {1,2, ...,n} and independent of 

(W 1 ,W 2 ,X n ,Y 1 n ,Y 2 n ). Let us define U = U Q , V = (Z Q+ \Q), X = X Q , Y x = Y 1<Q , Y 2 - Y 2>Q , Z = Z Q , then Rt and 
R 2 can be written as 

Ri< I(X; Y ± \U, V) + I(U; Z\V) - I(X; Z\V), (32) 
R 2 <I(U;Y 2 \V)-I(U;Z\V). (33) 
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Note that the boundary of this region is characterized by the maximization of R\ + [iR 2 over this region for fx > 1. On the 
other hand we have, 

Ri + ^R2 < I(X; Y^U, V) + I{U: Z\V) - I{X; Z\V) + /* {I(U; Y 2 \V) - I(U; Z\V)) (34) 

Since conditional mutual information is the average of the unconditional ones, the largest region is achieved when V is a 
constant. This proves the converse part. 

Appendix III 
Proof of Theorem[3] 

Achievability: Let U ~ Af(0, (1 - a)P) and X ~ 7V(0, aP) be independent and X = U + X' ~ 7V(0, P). Now consider 
the following secure superposition coding scheme: 

1) Codebook Generation: Generate 2 n/ ( c/;1 ' 2 ) i.i.d Gaussian codewords u n with average power (1 — a)P and randomly 
distribute these codewords into 2 nR ' 1 bins. Then index each bin by w 2 G {1, 2, 2 nR2 }. Generate an independent set of 
2 nI ( x ' Yl ) i.i.d Gaussian codewords x n with average power aP. Then, randomly distribute them into 2 nRl bins. Index each 
bin by w\ G {1, 2, 2 nRl }. 

2) Encoding: To send messages w\ and w 2 , the transmitter randomly chooses one of the codewords in bin w 2 , (say u 11 ) and 
one of the codewords in bin w\ (say x n ). The transmitter then simply transmits x n = u n + x n . 

3) Decoding: The received signal at the legitimate receivers are t/" and y% , respectively. Receiver 2 determines the unique 
u n such that (m™,^) are jointly typical and declares the index of the bin containing u" as the message received. If there is 
none of such or more than one of such, an error is declared. Receiver 1 uses the successive cancelation method; it first decodes 
u n and subtracts it from y" and then looks for the unique x ™ such that (x — u ) are jointly typical and declares the 
index of the bin containing x " as the message received. 

It can be shown that if R\ and i?2 satisfy ( f]~5T > and (fTol i. the error probability analysis and equivocation calculation is 
straightforward and may therefore be omitted. 

Converse: According to the previous section, R 2 is bounded as follows: 



nR 2 < I(Y 2 n ; U n \Z n ) = h(Y 2 n \Z n ) - h(Y 2 n \U r \ Z n ), (35) 
where h is the differential entropy. The classical entropy power inequality states that: 

2 | h (Y?+N' 3 n ) > 2 $HY?) + 2 #' l ( JV 3") 

Therefore, h(Y 2 \Z n ) may be written as follows: 

h{Y 2 n \Z n ) = h(Z n \Y 2 n ) + h(Y 2 n ) ~ h(Z n ) 

= - log 2ve{ai -al) + h(Y 2 n ) - h(Y 2 n + N 3 n ) 



< $ log 2ne(al - a\) + h(Y 2 ) - - \og(2^) + ^e(v\ - a\)). 



^\ og 2Ke{ai-al) + h{Y?)-^ 
On the other hand, for any fixed a G 1Z, the function 



77 9 

/(t,o)=t-|log(2«* + a) 

is concave in t and has a global maximum at the maximum value of t. Thus, h(Y 2 n \Z n ) is maximized when Y 2 n (or equivalently 
X n ) has Gaussian distribution. Hence, 

77 7? 71 

h(Y 2 n \Z n ) < - log 2ve{al - o\) + - log 2ve{P + a 2 2 ) - - log 2ne(P + of) 



? l0g ( 



/27re(a 3 2 -a2)(P + a 2 2 



(36) 



Note that another method to obtain d36t is using the worst additive noise lemma (see [46], [47] for details). Now con- 
sider the term h(Y 2 \U n , Z n ). This term is lower bounded with h(Y 2 n \U n , X n , Z n ) = ^\og2ne(a 2 ) which is greater than 

f log27re(^Hp^). Hence, 

2/2 2 \ 

glog2^e( g2lg3 ~ a2) ) < h{Y 2 n \U n ,Z n ) < h(Y 2 n \Z n ). (37) 
2 er 3 

Inequalities (l36l l and d37l i imply that there exists an a £ [0, 1] such that 



n 



(2ne{al ~al)(aP + a 

' 4 



h(Y 2 n \U n ,Z n ) = - log ' 21 ) . (38, 
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Substituting d38l and d36l l into (l35T l yields the desired bound 



- ^(^)-naf^). (39) 
1 aP + of J \ aP + of ' 

Note that the left hand side of (138]) can be written as h(Y 2 n , Z n \U n ) - h(Z n \U") which implies that 

MKT|LH - h{Z»\U») = \ log (^^1) • (40) 

Since cr^ < a\ < of, there exists a < /3 < 1 such that cr| = (1 — /3)<r^ + /3of, or equivalently, of = a\ + /3(of — of). 
Therefore, since Y™ — > Y 2 ™ — * Z n forms a Markov chain, the received signals Z n and Y 2 ™ can be written as Z n = Y" + N n 
and Y 2 ™ = Y/ 1 + \f/3N n where N is an independent Gaussian noise with variance a 2 = of — of. All noises are Gaussian 
random n-vector with a positive definite covariance matrix. Costa's entropy power inequality [48] states that (see also [49] for 
its linear version), 

2 ih(Y?+VPN n \v n ) > (i _ /3)2" h{Y " lun) + /32"' i(y " + ^" |c/ " ) (41) 

for any random n-vector Y™ and Gaussian random n-vector of N n . Equivalently we have, 

2 £MCT") > (i _ f3)2^r\u n ) + p2$h(Z»\u») (42) 

After some manipulations of (1421 1. we obtain 

h(Y?\U n ) - h(Z n \U n ) 

n. f aP + of - [3{aP + of) 



n faP + af\ 



2'"°l (l-/3)(aP + o 3 2 ) 
n, faP + al 

-2 l0g {^pT4 

The rate i?i is bounded as follows 

nR x < I(X n ;Y{ l \U n ) - I(X n ;Z n ) + I(U n ;Z n ) (44) 
= h{Y?\U n ) - h(Y?\X n , U n ) + h(Z n \X n ) - h{Z n \U n ) 

= h{Y?\U n )-h{Z n \U n ) + ^-\og£\) 

- 

where (a) follows from ( |43l . 

Appendix IV 

Complementary Lemmas for Equivocation Analysis 
Lemma 3 Assume U n , V", V 2 and Z n are generated according to the achievablity scheme of Theorem^ then we have, 

I(Vr,V 2 n ; Z n \U n ) < nI(Vi, V 2 ; Z\U) + nS ln , 
I{V?\ V 2 n \U n ) < nI(V i; V 2 \U) + n6 2n . 

Proof: Let A"(Pjjy ll v 2 ,z) denote the set of typical sequences (U n , V™, V 2 , Z n ) with respect to Pu,v 1 .y 2 .z, and 

' 1, (U n ,V?,V 2 n ,Z n )tA?(P uyuV2 , z ); 
0, otherwise, 

be the corresponding indicator function. We expand /(Vj", V 2 n ; Z n \U n ) as follow, 

704™, V 2 n ; Z n \U n )< I(Vf, V 2 n , C; Z n \U n ) (45) 

= I(Vf, V 2 n ; Z n \U n ,Q + I(C; Z n |t/™) 
i 

= E P (C = j)/(VT, V 2 "; Z"|C/", C = j) + 7(C; 

j=0 



c = 
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According to the joint typicality property, we have 

P(C = l)I(Vr, V 2 n ; Z n \U n , C = 1)< nP((U n , V{\ V 2 n , Z n ) { A^{Pu,v lt v 2 ,z)) log \\Z\\ (46) 

< ne„log||S||. 

Note that, 

I(C;Z n \U n ) < H(0 < 1 (47) 

Now consider the term P(£ = 0)7(1^™, V^ 1 ; Z n \U n , £ = 0). Following the sequence joint typicality properties, we have 

P(( = 0)/(V r 1 ™, y 2 "; Z n |C/™, C = 0)< I(y?, V 2 n ; Z n \U n ,C = 0) (48) 

P(U n ,V?,V 2 n ,Z n )(\ogP(V?,V 2 n ,Z n \U n )-\ogP{V?,V 2 n \U n ) 

(u n ,v? ,vg ,z«)eA? 
-logP(Z"|(7")), 

< n y 2 , Z\U) + H(V U V 2 \U) + H(Z\U) + 3e n ] , 

= n[I(V u V 2 ;Z\U)+3e n ], 

By substituting i46\ , t47\ , and ( |48l into ( [45) , we get the desired reasult, 

/(^, Kf; Z" |[/")< n/(Vi , y 2 ; Z|[7) + n ( e n log ||Z|| + 3e„ + -) , (49) 



= 7i/(y 1 ,^ 2 ;Z|C/) + n5 1 „, 

where, 

Si n = e«log||Z|| + 3e„ + -. 

Following the same steps, one can prove that 

I(Vr;V 2 n \U n ) < nI(Vi;V 2 \U) + nS 2n . (50) 

■ 

Using the same approach as in Lemma [3] we can prove the following lemmas. 

Lemma 4 Assume U n , V", Y" and Y 2 are generated according to the achievablity scheme of Theorem\l\ then we have, 

I{V?;Y?\U n ) < nI(Vi;Yi\U)+nS 3n , 
I{V?;Z n \U n ) < nI(Vi;Z\U)+nS in , 
I(U n ;Z n ) <nI(U;Z)+nS 5n , 
I[U n ;Y?)<nI{U;Yi)+nS 6n 
I(U n ;Y 2 n ) <nI(U;Y 2 )+nS 7n 

Proof: The steps of the proof are very similar to the steps of proof of Lemma [3] and may be omitted here. ■ 
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